Designing Energy-Efficient Fetch Engines

نویسندگان

  • James Cohoon
  • Jack W. Davidson
  • John Lach
  • Christopher W. Milner
چکیده

This dissertation evaluates factors that affect the energy-efficiency of the fetch engine in a programmable uniprocessor. The central thesis is that branch prediction is one of the key factors affecting overall processor energy-efficiency. Cooling costs, extending battery life in mobile devices, and reducing utility costs for wall-powered systems, especially data centers are growing concerns. Therefore, better understanding the factors that affect overall processor energy-efficiency has become important for virtually all new processor designs. For a processor to be energy-efficient, power consumption and performance must be balanced. The fetch unit is key to the processor’s execution efficiency because it predicts and fetches all the instructions sent to the execution units. When fetching is stalled, or following an incorrectly predicted path, the rest of the processor is unable to perform useful work (and continues to consume energy). Many different fetch engine designs have been proposed in the research community and understanding how the fetch organization affects the processor’s overall energy-efficiency is essential. For example, previous research has shown that trace caches can yield significant improvement in instruction throughput, but it is unclear whether trace caches are more energy-efficient than traditional instruction caches. Similarly, branch predictors improve fetch bandwidth by increasing instructionlevel parallelism (ILP), but their energy-efficiency has not previously been systematically evaluated.

برای دانلود رایگان متن کامل این مقاله و بیش از 32 میلیون مقاله دیگر ابتدا ثبت نام کنید

ثبت نام

اگر عضو سایت هستید لطفا وارد حساب کاربری خود شوید

منابع مشابه

Latency Tolerant Branch Predictors

The access latency of branch predictors is a well known problem of fetch engine design. Prediction overriding techniques are commonly accepted to overcome this problem. However, prediction overriding requires a complex recovery mechanism to discard the wrong speculative work based on overridden predictions. In this paper, we show that stream and trace predictors, which use long basic prediction...

متن کامل

An Accurate and Energy-Efficient Way Determination Technique for Instruction Caches by Using Early Tag Matching

Energy consumption has become an important design consideration in modern processors. Therefore, microarchitects should consider energy consumption, together with performance, when designing the cache architecture, since it is a major power consumer in a processor. This paper proposes an accurate and energy-efficient way determination (instead of prediction) technique for reducing energy consum...

متن کامل

Energy-Efficient Simultaneous Thread Fetch from Different Cache Levels in a Soft Real-Time SMT Processor

This paper focuses on the instruction fetch resources in a real-time SMT processor to provide an energy-efficient configuration for a soft real-time application running as a high priority thread as fast as possible while still offering decent progress in low priority or non-realtime thread(s). We propose a fetch mechanism, Fetch-around, where a high priority thread accesses the L1 ICache, and l...

متن کامل

Using a serial cache for energy efficient instruction fetching

Computer Science Department, University of California, Los Angeles Department of Computer Science and Engineering, University of California, San Diego Abstract The design of a high performance fetch architecture can be challenging due to poor interconnect scaling and energy concerns. Way prediction has been presented as one means of scaling the fetch engine to shorter cycle times, while providi...

متن کامل

On the Performance of Fetch Engines Running DSS Workloads

This paper examines the behavior of current and next generation microprocessors’ fetch engines while running Decision Support Systems (DSS) workloads. We analyze the effect of the latency of instructions being fetched, their quality and the number of instructions that the fetch engine provides per access. Our study reveals that a well dimensioned fetch engine is of great importance for DSS perf...

متن کامل

ذخیره در منابع من


  با ذخیره ی این منبع در منابع من، دسترسی به آن را برای استفاده های بعدی آسان تر کنید

عنوان ژورنال:

دوره   شماره 

صفحات  -

تاریخ انتشار 2006